home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
ftp.cs.arizona.edu
/
ftp.cs.arizona.edu.tar
/
ftp.cs.arizona.edu
/
tsql
/
doc
/
tsql.mail
/
000070_@ICINECA.CINECA…s64.cineca.it _Tue Apr 6 18:44:49 1993.msg
< prev
next >
Wrap
Internet Message Format
|
1996-01-31
|
8KB
Received: from icineca.cineca.it by optima.cs.arizona.edu (5.65c/15) via SMTP
id AA22404; Tue, 6 Apr 1993 09:43:38 MST
Received: from deis64.cineca.it by ICINECA.CINECA.IT (IBM VM SMTP V2R2)
with TCP; Tue, 06 Apr 93 18:43:19 SET
Received: from [137.204.57.79] (deis79) by deis64.cineca.it (4.1/SMI-4.1)
id AA13038; Tue, 6 Apr 93 18:45:27 +0200
Date: Tue, 6 Apr 93 18:44:49 +0100
From: (Fabio Grandi) <fabio@deis64.cineca.it>
Message-Id: <67495.fabio@deis64.cineca.it>
To: tsql@cs.arizona.edu
Subject: Valid/Transaction times orthogonality
The following message regards valid and transaction time
orthogonality in temporal queries.
It continues a correspondence started with Shashi Gadia which
- I agree - may be of general interest.
=============================================================================
My original questions arose from Gadia and Sunil's email on March 31 (EM1),
which was a contribution to the TSQL benchmark discussion
(as well, such questions may be part of the discussion
around a next phase of the benchmark, when transaction-time
queries are going to be included).
First of all, as far as the Gadia's reply on April 5 (EM2)
is concernded, I completely agree with his view.
It also helped to clarify some points of EM1 which
seemed to me a little questionable.
I make no doubt that:
1) PURE valid time query: "when did John work in toys"
(i.e. valid time query in a valid-time database)
and:
2) PURE transaction time query: "when did John work in toys"
(i.e. transaction time query in a transaction-time database)
are orthogonal and are to be expressed in a similar way
in a (mono)temporal query language. Consequently, we can also accept
a common English formulation for them. Only the (*implied*)
time dimension varies.
In a bitemporal database, I also share the view that
the counterpart of (1) and (2) is:
3) PURE bitemporal query: "when did John work in toys"
(i.e. bitemporal query in a bitemporal database)
Although (3) can have the same expression as (1) and (2)
in a (bi)temporal query language (provided that bitemporal
rather than monotemporal timestamps are retrieved),
a little more questionable thing is the most "natural" meaning
of its English formulation in this framework.
Well, my initial observations addressed to Gadia arose from the examples
contained in section "transaction time queries" of EM1, which
seemed to me somewhat confusing.
EM1 suggested the *addition* of transaction time queries
to the tsql benchmark, based so far on a valid-time model,
so I assumed the scope of EM1 to be bitemporal and, thus,
I understood a bitemporal context as I was reading the examples:
4) valid time query: "when did John work in toys"
(valid time query in a bitemporal database)
5) transaction time query: "when was John believed to have worked in toys"
(transaction time query in a bitemporal database (?))
In my opinion, IN A BITEMPORAL CONTEXT, there's not a clear orthogonality
between (4) and (5), such that their corresponding sql-queries are identical.
For instance, it can be noticed that (2) and (4) have the same English
formulation though they are supposed to represent different kinds of queries.
I feel that the most "natural" meaning of the question word "when"
- concerning some "fact" and not some "belief" -
in a bitemporal context is to denote a historical inquiry,
along valid time, in a database which is kept up-to-date
by a human being (and not by a wizzard) as best as he can (now).
Hence, I feel that (4) usually implies in a "natural" way the
transaction time qualifier *as of now* and its more explicit meaning is:
4a) retrieve the valid timestamps of the data
telling that John worked in toys
from the database restricted to the transaction timepoint "now"
The transaction-time counterpart of (4a) could be as well:
5a) retrieve the transaction timestamps of the data
telling that John worked in toys
from the database restricted to the valid timepoint "now"
which is orthogonal to (4a) but rather different, I believe, from (5).
On the other hand, the English query "when was John believed
to have worked in toys" may be something "more" than a simple
transaction time query in a bitemporal database.
I think it presents some very interesting semantic issues
which should be taken into account in future versions of the tsql benchmark.
First, if we say "John worked in toys" in a bitemporal context,
we likely mean he did it [in reality] in some valid time span.
Second, if we say "when was John believed" in a bitemporal context,
we likely mean he was believed [according to the information in the DB]
in some transaction time span, which is the target of our query.
Therefore, combining the two things, our query seems to look for
the transaction time spans in which it was stored in the database
that John worked in toys in some valid time span. Since such a valid
time span is not *explicit* in the query, we have to assume some kind
of *default* to fully map our query onto bitemporal data.
I would suggest that a "natural" default for the valid time span
could be in this case the same time span used for transaction time.
If we want to specify a *different* valid time span, we have to express
it explicitly, as in the query "when was John believed to have worked
in toys in 1990", which could be answered as:
pi ( sigma ( R ) )
Transaction time Name=John, Dept=toys, Valid time=1990
if R is the bitemporal relation containing John's and departments' data.
Assuming the suggested default ("coherent" values of transaction and
valid time), the query can be detailed as:
5b) retrieve all the (transaction/valid) time points T such that
John worked in toys
from the data valid at valid time T
from the database as of transaction time T
In this case, the qualifying timestamps must be retrieved
from the data laying along the *diagonal* of the time quadrant
whose axes are transaction and valid time.
This kind of retrieval should be supported by a temporal query language,
since its semantics seems to be interesting from an application point
of view.
Let us consider, for instance, a database storing employees' data
including an attibute "Qualification" on which the salary value depends.
Let us make the following (simplified) assumptions:
a) salaries are payed monthly, in advance, on the 1st of each month;
b) for each employee, the value to be payed on the 1st of "M"
is evaluated as a function of the attribute value Employee.Qualification
valid on the 1st of "M" as of the 1st of "M",
which is the correct value for month "M" as best known on the 1st of "M"
(no wizzard can forecast future retroactive updates);
c) due to a mistake of a terminal operator, a wrong value, say "Q2",
of the John's Qualification was stored in the database;
d) some time later, John (or someone else) noticed the mistake
and a retroactive modification was issued to correct the
Qualification value;
e) the accounting department is then informed that John's salaries
must be audited;
f) the accounting department has to know for which months
the John's salary must be recomputed; this can be answered
by the query:
"when John's Qualification was believed to have been Q2"
This query can be detailed as:
retrieve all the months "M" such that
John's Qualification=Q2
from the data valid on the 1st of "M"
from the database as of the 1st of "M"
As a matter of fact, it should be noticed that
the Qualification value Q2 (valid on and as of the 1st of "M")
was used on the 1st of "M" to compute the wrong salary amount
which was *actually* payed to John, according to (b).
I hope that this example (and my suggestions)
would come out more clear than obscure.
I would greatly appreciate any comments on the topic.
Sincerely,
Fabio Grandi.